NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

NeuralFeels with neural fields: Visuotactile perception for in-hand manipulation

https://doi.org/10.1126/scirobotics.adl0628

Suresh, Sudharshan; Qi, Haozhi; Wu, Tingfan; Fan, Taosha; Pineda, Luis; Lambeta, Mike; Malik, Jitendra; Kalakrishnan, Mrinal; Calandra, Roberto; Kaess, Michael; et al (November 2024, Science Robotics)
Yashinski, Melisa (Ed.)
To achieve human-level dexterity, robots must infer spatial awareness from multimodal sensing to reason over contact interactions. During in-hand manipulation of novel objects, such spatial awareness involves estimating the object’s pose and shape. The status quo for in-hand perception primarily uses vision and is restricted to tracking a priori known objects. Moreover, visual occlusion of objects in hand is imminent during manipulation, preventing current systems from pushing beyond tasks without occlusion. We combined vision and touch sensing on a multifingered hand to estimate an object’s pose and shape during in-hand manipulation. Our method, NeuralFeels, encodes object geometry by learning a neural field online and jointly tracks it by optimizing a pose graph problem. We studied multimodal in-hand perception in simulation and the real world, interacting with different objects via a proprioception-driven policy. Our experiments showed final reconstructionFscores of 81% and average pose drifts of 4.7 millimeters, which was further reduced to 2.3 millimeters with known object models. In addition, we observed that, under heavy visual occlusion, we could achieve improvements in tracking up to 94% compared with vision-only methods. Our results demonstrate that touch, at the very least, refines and, at the very best, disambiguates visual estimates during in-hand manipulation. We release our evaluation dataset of 70 experiments, FeelSight, as a step toward benchmarking in this domain. Our neural representation driven by multimodal sensing can serve as a perception backbone toward advancing robot dexterity.
more » « less
Full Text Available
Where2Act: From Pixels to Actions for Articulated 3D Objects

https://doi.org/10.1109/ICCV48922.2021.00674

Mo, Kaichun; Guibas, Leonidas; Mukadam, Mustafa; Gupta, Abhinav; Tulsiani, Shubham (October 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

One of the fundamental goals of visual perception is to allow agents to meaningfully interact with their environment. In this paper, we take a step towards that long-term goal – we extract highly localized actionable information related to elementary actions such as pushing or pulling for articulated objects with movable parts. For example, given a drawer, our network predicts that applying a pulling force on the handle opens the drawer. We propose, discuss, and evaluate novel network architectures that given image and depth data, predict the set of actions possible at each pixel, and the regions over articulated parts that are likely to move under the force. We propose a learning-from-interaction framework with an online data sampling strategy that allows us to train the network in simulation (SAPIEN) and generalizes across categories. Check the website for code and data release.
more » « less
Full Text Available
STEAP: simultaneous trajectory estimation and planning

https://doi.org/10.1007/s10514-018-9770-1

Mukadam, Mustafa; Dong, Jing; Dellaert, Frank; Boots, Byron (February 2019, Autonomous Robots)

Full Text Available
Sparse Gaussian Processes on Matrix Lie Groups: A Unified Framework for Optimizing Continuous-Time Trajectories

Dong, Jing; Mukadam, Mustafa; Boots, Byron; Dellaert, Frank (January 2018, IEEE International Conference on Robotics and Automation)

Full Text Available
Simultaneous Trajectory Estimation and Planning via Probabilistic Inference

https://doi.org/10.15607/RSS.2017.XIII.025

Mukadam, Mustafa; Dong, Jing; Dellaert, Frank; Boots, Byron (July 2017, Robotics: Science and Systems)

Full Text Available
Learning Generalizable Robot Skills from Demonstrations in Cluttered Environments

M. Rana, Asif; Mukadam, Mustafa; Ahmadzadeh, S. Reza; Boots, Byron; Chernova, Sonia (January 2018, Proceedings of the International Conference on Intelligent Robots and Systems)

Full Text Available
Approximately optimal continuous-time motion planning and control via Probabilistic Inference

https://doi.org/10.1109/ICRA.2017.7989082

Mukadam, Mustafa; Cheng, Ching-An; Yan, Xinyan; Boots, Byron (May 2017, International Conference on Robotics and Automation)

Full Text Available

Search for: All records